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ABSTRACT 



We present a new technique for estimating the distance to young open clusters. 
The method requires accurate measurement of the axial rotation period of late-type 
members of the cluster: rotation periods are first combined with projected rotation 
velocities and an estimate of the angular diameter for each star - obtained using 
the Barnes-Evans relation between colour and surface brightness. A 'best' cluster 
distance estimate is then determined using standard techniques from the theory of 
order statistics, which are in common use in the general statistics literature. It is hoped 
that this new method will prove a useful adjoint to more traditional distance methods, 
in order to better ascertain the distance scale within the solar neighbourhood. 



Key vifords: 

stars: distances 
general. 



stars: late-type - stars: statistics - open clusters and associations: 



1 INTRODUCTION 

The accurate measurement of distances on intergalactic and 
, extragalactic scales poses a problem of critical importance 
' to both astronomical and cosmological research. There ex- 
ists in common use a variety of different distance indica- 
tors; bridging and relating these indicators has enabled the 
construction of a cosmological distance ladder, the precise 
, calibration of which is heavily reliant upon several key dis- 

' tances. 

Of[ 



fnnda,menta,l importa.nce is the di.stance determina,- 



tion of nearby galactic clusters; for example the Hyades dis- 
tance is a crucial 'yardstick' upon which the calibration of 
the extragalactic distance scale and ultimately the Hubble 
constant depend. 

In this paper we describe a new technique for estimating 
the distance to nearby galactic clusters. The method is ap- 
plicable to young clusters containing fast rotating late type 
stars, the periods of which may be determined by measur- 
ing rotational modulation due to surface inhomogeneities. 
Prime examples of such clusters are the Pleiades and a Per- 
sei, which are the next step beyond the Hyades in the stan- 
dard zero-age main sequence (ZAMS) fitting procedure. 

The idea of using stellar rotation periods to esti- 



mate cluster distances was pr eviously discussed in CoL 
lier Cameron fc Woods (1992)| (hereafter CCW) although 



no detailed statistical analysis was attempted. In CCW it 
was observed that the measured rotation period, P, of a 
star may be combined with the projected rotational veloc- 



ity, V sin i, to determine an estimate of the star's 'projected' 
radius, Rsmi, viz:- 



i?sini = 



Pv sin i 
27r 



(1) 



Here the inclination, i, takes its usual definition; the angle 
between the rotation axis of the star and the line of sight. 

In CCW an estimate of the stellar angular diameter, (j), 
was then inferred for each star using the relation between 
colour and surf ace brightness derived by Barnes, _Evans & 



Moffett (1978) (hereafter BEM).( See also Section Q below). 



A 'projected' cluster distance, Ds'mi, then follows in the 
obvious way, viz;- 

2R sin i 



Ds'mi — 



(2) 



where (j) is measured in radians. Substituting from equation 
(1) we find;- 

Pv sin i 



D sin i = 

TT(p 

Or, expressing in more convenient units:- 
i3sin^ = 7.660 x 10-^ 



(3) 



(4) 



where now P is measured in hours, v sin i in kms ^ , in mil- 
liarcseconds and _Dsini in parsecs. This is essentially equa- 
tion (4) of CCW. 

D sin i values were thus derived in CCW for five stars in 
the a Per cluster. The largest value of Dsini was adopted 



© 0000 RAS 



2 M.A. Hendry, M.A O'Dell and A. Collier Cameron 



as a crude estimate of the cluster distance: this was based 
on the assumption that the largest observed D sin i should 
correspond to the star with the highest axial inclination, so 
that sini might reasonably be taken to be equal to unity - 
the maximum value in the distribution of sini. 

This simple approach did indeed give a cluster distance 
estimate which was in reasonable agreement, but greater 
than, t he distan ce modulus (m — M)o = 6.1 quoted in Btauf- 
fer et a ,. (1985)| . The limitations of the approach, however. 



stem from the assumption that the distribution of Dsini 
values arises solely from the intrinsic distribution of stel- 
lar inclinations, when in practice it results from a number 
of contributory factors: the intrinsic distribution of rotation 
periods, rotation velocities, inclinations and diameters - to- 
gether with the observational scatter in the measurement of 
each of these quantities. 

The aim of this paper is to extend and improve the sim- 
ple treatment of CCW by addressing these limitations, and 
thus place the estimation of cluster distances from stellar 
rotation periods on a more rigorous statistical footing. In 
Section I we begin by describing in more detail the prin- 
ciples behind the calibration and use of the Barnes-Evans 
relation, upon which our distance method relies. In Section 
^ we then carefully model the intrinsic scatter and observa- 
tional selection effects which contribute to the distribution 
of D sin i values. Such a detailed treatment is essential before 
one can meaningfully assign an error to any cluster distance 
estimate. 

Having thus derived the D sin i distribution expected 
for a given cluster, in Section ^ we consider several different 
distance estimators which one may define in terms of the ob- 
served 73 sini values. We investigate the properties of these 
estimators, demonstrating how one may assign the appro- 
priate distance error to each, and how they may be used to 
construct confidence intervals for the true cluster distance. 
In particular we show how one may improve upon the crude 
estimator adopted in CCW by combining the D sin i values 
inferred for all of the observed stars - instead of using simply 
the largest value. Finally, using artificially generated data, 
we examine how the accuracy of our new distance method 
varies with the number of observed cluster stars, and the in- 
trinsic scatter in the Barnes-Evans relation. Based on these 
results we assess the usefulness of our new method set along- 



side th( more traditional cluster distance indicators such as 
the ZAMS httmg procedure. In a torthcommg paper, cur- 
rently in preparation, we will apply our distance method to 
a larger sample of a Per stars and the Pleiades cluster for 



which lotation periods have been determined, and make a 
direct quantitative comparison between the cluster distance 
estimated by our method and that derived from ZAMS fit- 
ting. 



2 THE RELATION OF BARNES AND EVANS 
(STELLAR ANGULAR DIAMETERS VS 
VISUAL SURFACE BRIGHTNESS/COLOUR) 

In order to calculate aprojected cluster distance, D sini, we 
have seen in Section that it is necessary to estimate the 
angular diameter (j> of each star. This can be found to a lim- 
ited degree of a ccuracy by introducing the work of 



lar diameters to visual surface brightness and thus to colour 
index. 

This correlation had been known for some time. Similar 
wor k was carried out by 



Wesselink (1969), Warner (1972) 



and Harwood et al. (1975) but was limited basically to early 
type stars and the B-V colour index. The work of Barnes 
and Evans extended the correlation to the entire UBVRI 
system including red stars to spectral type M8. Their work 
was partly instigated by the sudden availability of many 
more stellar angular diameters found through a program 
observing the lunar occultations of nearby stars. 

Barnes and Evans defined a quantity the visual sur- 
face brightness parameter which can be shown to be:- 



Fv = 4.2207 - 0.114 - QMog<j) 



(5) 



where 14 is the unreddened apparent magnitude and <j) is the 
stellar angular diameter expressed in milliseconds of arc. 

Computing for the nearby calibration stars and then 
plotting the resultant values against the various colour in- 
dices; B-V, V-R and R-I (shown in BE) confirmed a linear 
relationship between these indices and surface brightness. 
(A limitation was imposed for B-V where the relation breaks 
down and the index no longer relates to the energy output 
of the star). In BEM Barnes, Evans and Moffett improved 
these relations by adding more stars and correcting for the 
effects of limb darkening. 

Of all the relationships the V-R vs F^ appears to have 
the tightest correlation but for the purposes of this paper 
we will follow the work of CCW and employ the B-V vs F^ 
relation which is linear over the spectral range of the a Per 
G-K dwarfs. 

From BEM the B-V vs Fy relation is linear over the 
interval:- 

Fv = 3.964 - 0.333(5 - V)o , -0.10<(B - F)o<1.35 (6) 

where {B — V)o is the unreddened colour index. 

Equating equation (5) (theoretically derived) to equa- 
tion (6) (empirically derived) and converting to reddened 
colour index and apparent magnitude gives the semi- 
empirical relation :- 

: 0.5134 - Qm&EB-v + 0.666(B ~V)~ 0.2V (7) 



assuming Ay ~ Z.OEb-v for the Perseus region Hiltner & 
Johnson (1956) (see equation (3), CCW) 

obtain cf> from a knowledge of B-V, 
for which an a verage value 



We can, therefore 
V and the colour excess Eb-v 

of 0.10 - 0.11 w as a dopted for the a Per region (Crawford & 



Barnes (1974) and Prosser (1991)) 



Barnc! 



fc Evans (1976) (hereafter BE) who correlated stellar angu- 



In effect we are working the original computation of 
Barnes and Evans backward to obtain from F^ given colour 
and apparent magnitude V . 

Most of the uncertainty in the stellar angular diameter 
cf> stems from the original lunar occultation measurements 
of the nearby calibration stars (typically 5 - 20 % (BE). 
More difficult to quantify (due to the paucity of calibration 
stars) is the uncertainty in the interpolation of the F^ vs 
B-V graph within the range of our G-K dwarfs. Finally to 
a lesser degree we must also consider the systematic errors 
inherent from the measurement of B-V, V and Eb-v and 
the approximation A„ ~ 3.0-Bs_v- 

The computed values of 4> generated from equation (6) 
are then substituted into equation (3) to obtain a Dsini 
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value for each star whose axial rotation period and projected 
equatorial velocity are known. Having generated a range of 
D sin i values we then obtain a number of different cluster 
distance estimates by interpreting the 'observed' distribu- 
tion of Dsini values according to the statistical methods 
described in Section 0- 



3 DISTRIBUTION OF DSINI 

In this section we construct a model for the distribution of 
D sin i values which one would expect to observe in a cluster 
at a given true distance. We will then later use this distri- 
bution as our basis for defining different cluster distance 
estimators and studying their properties. 

A general method for deriving this distribution may 
be found in any elementary statistics textbook. Firstly one 
adopts a model for the joint probability distribution of the 
observed rotation period, rotation velocity, inclination and 
angular diameter - the variables in terms of which D sin i is 
defined. This distribution must take account of the intrinsic 
spread in these variables from star to star, the scatter due to 
measurement error in the values observed for a given star, 
and any observational selection effects to which the sample 
may be subject. One next uses equation (3) to define an 
appropriate transformation of P, v, i and (f>, such that the 
distribution of D sin i may then be extracted directly from 
the joint distribution of the transformed variables. 

Although this method is straightforward to apply in 
principle, it rarely yields an analytic solution in practice, and 
that is indeed the case here. The problem is easily tackled 
via Monte Carlo simulations, however: i.e. we draw a large 
number of sets of the random variables, P, v, i and 0, and 
for each set compute the corresponding value of Dsini. We 
then deduce the distribution of D sin i by constructing a 
histogram of the computed values. 

Our analysis can be further simplified by making one 
additional assumption and a judicious change of variables, 
as we now show. 

Suppose that we observe a cluster which lies at true 
distance Dtmc parsecs. We can regard Dtruc as a fixed - 
though of course unknown - parameter, which we wish to 
estimate. Consider a star in the cluster and let Ptrue, vtme 
and 4)true denote the true rotation period, rotation velocity 
and angular diameter (in radians) respectively of this star. 
Observe that these variables are not mutually independent, 
but are related by the following equation:- 



Pt 



rue ''-'true 



71" (ptrue 



(8) 



Equation (8) assumes that the star lies precisely at the cen- 
tre of the cluster - i.e. we neglect any line of sight depth. 

Consider again equation (3) above, which expresses 
Dsini in terms of the observed values of P, v, i and (j>- 
Introducing two new variables, Zp = p°^" , and = , 
and combining with equations (3) and (8) we may write:- 



Dsini — 



Aruc (usini)obs Zp 



Dtr 



(9) 



This change of variables has two immediate advantages. 
Firstly, we no longer require to model the intrinsic distribu- 
tion of P and (j), only the observational scatter about their 



true values. One might reasonably expect these error dis- 
tributions to be independent of the value of Ptruc and ^truc 
respectively. Secondly, and more importantly, introducing Zp 
and Z0 simplifies the dependence of the D sin i distribution 
upon Dtruo- Clearly once we have modelled the distribution 
of the composite variable, a, in equation (9), we can obtain 
the distribution of D sin i for a cluster at any true distance 
simply by rescaling. One may think of a as a dimensionless 
projection factor, essentially equivalent to sin i. 

As an example. Figures |l] and |^ show probability den- 
sity curves for a, determined for a particular selection func- 
tion, and for different angular diameter error dispersions. 
The probability density curves were obtained by spline fit- 
ting to histograms constructed from 50000 trials. The Monte 
Carlo sampling was carried out as follows. 

(i) The true rotation velocity was first drawn from a uni- 
form distribution in the range to 240kms~^. For a star 
of one solar radius an equatorial velocity of 240kms~^ cor- 
responds to a rotational period of ~ 5 hours, which was 
the shorte st measured rotation period in th e a Per sample 
studied in 



O'DeU & Collier Cameron (1993) 



(ii) A true inclination was then assigned, based on the 
standard assumption that the orientation of the stellar ro- 
tation axis is completely rando m with no preferred direc- 
tion in space (Bernacca (1970)). ft follows easily from this 
assumption that the intrinsic distribution of cosi is uniform 
over the interval [0, 1]. 

(iii) The observed v sin i was assigned by multiplying 
(i)sini)true by a Gaussian of unit mean and dispersion of 
0.1; a relative error of 10% being typical for measurements 



of spec tral line broadening of cluster stars (Stauffer et al. 



(1985)) 



(iv) A lower selection limit of (i;sini)obs > 50kms~ was 
then imposed. This w as the limit adopted in selecting the a 
Per sample studied in O'DeU & Collier Cameron (1993) This 



selection function was primarily designed to ensure that only 
fast rotators were included in the sample - thus improving 
the chances of detecting rotational modulation within the 
available observing time. 

(v) Finally, the scaled variable, z^ was drawn from a 
gaussian of unit mean and constant dispersion, cr,^, where 
o"^ = 0.05 in Figure ^, and ct^ = 0.1 in Figure ^. Note that 
assigning a constant percentage error dispersion to the ob- 
served angular diameter was equivalent to a gaussian scatter 
of constant dispersion in the linear Barnes- Evans relation for 
logtj), as given by equation (7). 

(vi) In all cases the other scaled variable, Zp, was set iden- 
tically equal to unity: i.e. it was assumed that the true 
axial rotation period of each star was recovered exactly 
from fourier analysis of the light curve. This approximation 
seemed reasonable when the data quality was sufficiently 
high to ensure little ambiguity in the sta r's power spectrum. 



(See O'DeU & Collier Cameron (1993) for further discus- 
sion). In such a case the error in the derived period would 
be considerably smaller than the uncertainty in both (pobs 
and {v sin i)obs. 

It is important to note at this point that the lower selec- 
tion limit, (usini)obs > 50kms^^, also serves indirectly to 
exclude stars of low inclination (i < 12° ), since we are im- 
posing an upper limit on the true rotation velocity. We would 
certainly expect some selection of this kind to exist in our 
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Figure 1. Probability density function of the scaled variable, o, 
derived from a spline fit to the histogram obtained from 50000 
Monte Carlo trials. The dispersion of the angular diameter errors 
was assumed to be cr^ = 0.05. 
intrinsic distribution of sinitrue 
errors and selection effects. 



The dashed curve indicates the 
in the absence of observational 



Figure 2. Probability density function of the scaled variable, a, 
derived from a spline fit to the histogram obtained from 50000 
Monte Carlo trials. The dispersion of the angular diameter errors 
was assumed to be tr^ = 0.1. The dashed curve indicates the 
intrinsic distribution of sinitrue in the absence of observational 
errors and selection effects. 



sample, since stars viewed at or near to pole-on would not 
display significant rotational modulation, and hence their 
periods could not be measured photometrically. The exact 
form of this selection at higher inclinations will depend upon 
the distribution of surface inhomogeneities, and in a more 
rigorous treatment this dependence could be modelled ex- 
plicitly in our Monte Carlo sampling: i.e. for a given surface 
distribution we could determine the probability of detecting 
rotational modulation as a function of inclination. We do 
not incorporate such a model in this paper, however, since 
our aim is primarily to illustrate the essential features of 
our new distance method. Thus we adopt an inclination se- 
lection given simply by the limit of i < 12° , as indicated 
above. In our next paper we will consider the precise form 
of the inclination selection in more detail - together with the 
inverse problem of how one might use our cluster distance 



estimate to infer the distribution of surface inhomogeneities 
for the sampled stars. 

We can see from equation (9) that, if one has perfect, 
selection- free, measurements of v sin i and (j) then a is identi- 
cally distributed as sin itruc- Hence, in this ideal case, the a 
distribution rises monotonically from a = 0, peaks at a = 1 
and drops immediately to zero for all a > 1, as shown by the 
dashed curves in Figures |l| and |. Recall from section |l| that 
this behaviour motivated the choice in CCW of the largest 
D sin i as the cluster distance estimate. When we include the 
effects of measurement errors and observational selection on 
V sin i and (/>, however, we find that the observed distribu- 
tion of a - as indicated by the solid curves - is substantially 
different from the intrinsic distribution of sinitruc, and this 
difference reveals a serious weakness in the CCW approach. 
Firstly we can see that the a distribution no longer extends 
to a = - i.e. the sample is biased against stars of low incli- 
nation, as discussed above. More importantly, however, the 
distribution displays a significant tail for a > 1. Specifically, 
when £70 = 0.05 approximately 20% of the a distribution 
lies in the range a > 1. From equation (9), therefore, the 
probability equals ~ 0.2 that one would infer - for any given 
star - a Dsini greater than the true cluster distance. This 
probability increases to ~ 0.25 for cr^ = 0.1. Moreover, for 
the sample of five stars considered in CCW, the probability 
that the largest value of D sin i is greater than Dtruc in- 
creases to ~ 0.68 for (T0 = 0.05, and to ~ 0.75 for ct^ = 0.1. 
(These numbers follow easily from standard results on the 
distribution of order statistics, which we will introduce in 
Section^ below). Thus we see that the cluster distance de- 
rived in CCW is quite likely an over - estimate, as a result 
of the failure to model the errors and selection on v sin i and 
cf>. Note that the CCW estimate of the a Per distance mod- 
ulus was indeed somewhat larger than that quoted in e.g. 



Stauffer et al. (1985) 



Can one define a better estimate of the cluster distance? 
Having shown how one may model the Dsini distribution 
for any given cluster, we can now provide a quantitative 
answer to this question. 



4 CLUSTER DISTANCE ESTIMATORS 

Given a set of D sin i values inferred from our sample of clus- 
ter stars we may define a number of different cluster distance 
estimators by combining these Dsini values in various dif- 
ferent ways. In this section we define and compare the prop- 
erties of four such estimators, beginning with the estimator 
adopted in CCW. 



4.1 'Naive' Estimator, -Dnaive 

This estimator is simply the largest value of Dsini in our 
sample, as discussed previously. Since we ignore the effects 
of errors in vsini and (j> in defining this estimator, we refer 
to it as 'naive'. Suppose we observe a sample of n stars, and 
that we order the Dsmi values inferred for these stars in 
increasing size. Let {D sin ; fc = 1, n} denote the ordered 
sample, i.e.:- 



D sin 1(1) < Dsini(2) 



< 



< Dsmi 



(n) 



(10) 



Then I^naivc is defined simply as:- 
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D sin i 



(n) 



(11) 



Note that we are adopting the usual statistical convention 
of using a caret to denote an estimator of a parameter. We 
will return to the statistical properties of ordered samples 
later in this section. 



4.2 'Mean' Estimator, Ancan 

If we are to define a better distance estimator than Dnaivc 
then clearly we must take some account of the true shape 
of the D sin i distribution. One might also wish to make use 
of all the stars observed in the cluster, instead of just the 
star with the largest Dsini. One obvious way to do this is 
by using the mean D sin i of our sample. 

Consider again equation (9) above. If we take the mean 
of both sides (more formally, the expectation value over the 
_Dsini and a distributions at fixed true distance, Dtruo), 
then it follows trivially that:- 



<Dsini>= Dtruo <ce> 



(12) 



We thus define our estimator, Dmcan , in terms of the sample 
mean value of D sin i and < a > , the mean value of a as 
determined from our modelled distribution function, viz:- 



fc=i 



(13) 



An equivalent way of looking at this estimator is as follows: 
for each star individually we can derive a distance estimate, 
which is given by the cluster distance required so that the 
inferred D sin i for that particular star is equal to the mean 
value of Dsini at that distance. -Dmcan is then simply equal 
to the mean value of these individual distance estimates. 



4.3 'Median' Estimator, /)mcd 

We can derive another estimator based on the form of the a 
distribution - this time using the median value, [ajmcd. Since 
this distribution is asymmetric, as we can see from Figures ^ 
and ^, [a]mcd will not generally equal the mean value, <a>. 
Our distance estimate is defined as the true distance for 
which the median of our ordered sample of observed D sin i 
values is equal to the the median of the expected Dsini 
distribution at that distance. Hence we have:- 



-DmoH 



\D sin i]med 

[a]mcd 



(14) 



where [ajmed denotes the median of the modelled a dis- 



tributic n and (Dsinj)mcd denotes the median of the sam- 
pled Dsmi values. We can write down an expression tor 
[Dsini] mod in terms of the elements of our ordered sample, 
viz;- 



[D sin i] 



D sin i, n+i , 

2 ' 



n odd 



i(D sin i( n ) + D sin i 



We can use a Monte Carlo approach to deduce the distri- 
bution of each of these estimators, and hence compare their 
relative accuracy. We generate a large number of artificial 
cluster samples of a given size - each sample drawn from our 
modelled Dsini distribution for a cluster at some chosen 
true distance. For each sample we then compute the cluster 



distance estimate predicted by each of the three estimators 
defined above. We determine the distribution of each estima- 
tor at the specified true distance by constructing a histogram 
of the computed values. 

As an illustration, Figures ^ to ^ show a range of re- 
sults obtained from generating 2000 cluster samples - con- 
taining 10, 20 and 30 stars, and for angular diameter error 
dispersions of 5% and 10% respectively. In all cases the true 
cluster distance was taken to be 200 pc, as indicated by the 
dashed line on each histogram. The mean values and dis- 
persions of each estimator, as calculated directly from the 
histograms, are summarised in Table 0. 

A consistent picture of the properties of the estimators 
emerges from these results. In all cases the naive estimator 
is positively biased - i.e. the mean value of the estimator 
is systematically greater than the true distance of 200 pc. 
One would expect, therefore, Dnaive to systematically over- 
estimate the cluster distance, as was suggested in section 
(3). Contrary to what one might first expect, the magnitude 
of the bias of Dnaive in fact increases as the sample size in- 
creases: this is because, with a larger number of sampled 
stars, it becomes more likely that one will sample a star 
which lies further into the positive tail of the a distribution. 
Hence Dnaive would become a progressively poorer distance 
estimator as we add more stars to our cluster sample. In the 
most extreme example considered here, for a cluster sample 
of 30 stars and for cr^ = 0.1, the mean value of Dnaive is 
more than 250 pc - representing a positive bias of ~ 25% . 
It follows from the form of equation (9) that the percentage 
bias of Dnaive is independent of Dtme, for a given sample 
size, so this figure would hold good for a cluster at any true 
distance. The bias of Dnaive also increases roughly in pro- 
portion to the angular diameter dispersion, o^, which is as 
one might expect. 

The bias of Dmean and Dmed on the other hand is negli- 
gible, irrespective of the sample size, for the cases considered 
here. This clearly demonstrates the importance of properly 
accounting for the true form of the Dsini distribution in 
defining a good cluster distance estimate. From Table ^ we 
also see that the dispersion of Dmean is found to be slightly 
smaller than that of Dmed in all cases, so that the former 
is marginally the more accurate estimator. We defer further 
discussion of Table |l] until later. 

Having shown from our simulations that both Dmean 
and Dmed are better distance estimators than Dnaive, we 
now consider whether one may improve still further upon 
the former two estimators. 

In the case of a normal random variable, th e sample 
mean is an exa mple of a sufficient statistic (c.f. Mood & 



Graybill (1974)). Broadly speaking, this means that if one 
wishes to estimate the mean, 6, of the normal distribution 
from a sample, {xi; i = 1, . . . , n}, then the sample mean, 
<x>= — ^ Xi, provides precisely the same statistical infor- 
mation about 6 as does the complete set of sampled values 
{xi\. i.e. knowledge of each Xi would not improve our esti- 
mate of 6 compared with that obtained from knowledge of 
<x> alone. This property does not hold in general for ran- 
dom variables which are not normally distributed, however. 
We can clearly see from Figures |l] and ^ that the distribu- 
tion of a is highly non-Gaussian: one might expect, there- 
fore, that knowledge of the individual D sin i values in our 
sample - and in particular their ordering - would allow one 
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Figure 3. Histograms of naive, mean and median cluster distance estimator distributions, derived from 2000 samples of 10 stars at a 
true distance of 200 pc. Results are shown for for an angular diameter error dispersion of cr^ = 0.05 and 0.1 



to define a 'better' (in the sense of having a smaller dis- 
persion) distance estimator than -Dmean or D^^d- Before we 
introduce such an estimator we first make some preliminary 
remarks about the properties of order statistics. 



The use of order statistics is a common technique in 
applied statistics, and the subject is treated extensively in 



a number of textbooks and monographs (c.f David (1981) 



Gumbel (1958)). Suppose we draw a sample of size n from 



the distribution of our random variable, a. By the r or- 
der statistic, which we denote by a(r), we mean simply the 
r*^ smallest member of the sample. Hence a(„) denotes the 
largest sampled value: this is precisely the notation intro- 
duced in equation (10) above. 



The probability density distribution, pr(Q(r)), of the r*** 
order statistic is closely related to the distribution of the 
parent random variable, a, viz:- 



(r - l)\{n - r)\ 



where p^a^j.-^) and P(a!(^)) denote respectively the probabil- 
ity density and cumulative distribution functions of the par- 
ent random variable, a, in both cases evaluated at a = a{r)- 
Figure ^ illustrates the distribution of several of the a or- 
der statistics, for a sample of 10 stars drawn from the a 
distribution shown in Figure |l| - i.e. for = 0.05. We can 
understand qualitatively how the shape of these distribu- 
tions emerges from the form of equation (16). The distribu- 
tion of a(i) for example has a larger dispersion than that of 
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Figure 4. Histograms of naive, mean and median cluster distance estimator distributions, derived from 2000 samples of 20 stars at a 
true distance of 200 pc. Results are shown for for an angular diameter error dispersion of cr^ = 0.05 and 0.1 



the higher order statistics a(7) and q:(io) shown here. This 
is because the shape of pi(a(i)) is determined primarily by 
the behaviour of p{a) at small a, in which range the term 
[1 — P{<^] in equation (16) is close to unity. We can see from 
Figure hi that p{a) is somewhat flatter in this range than for 
a ~ 1, and this behaviour is reflected in the shape of the 
higher and lower order statistics distributions. 

Our aim is to use the properties of the aj^j distribu- 
tions to define a better distance estimator. The question of 
how best to combine some or all of the ordered D sin i values 
measured from our sample into one single distance estimate 
is non-trivial, however. One could, for example, derive a sep- 
arate distance estimate - analogous to -Dmean or -Dmed - from 
each ordered D sin i in turn, and take the mean of these indi- 



vidual estimates as our adopted cluster distance; the shapes 
of the order statistic distributions shown in Figure ^ suggest 
that it would be inappropriate to assign equal weight to each 
order, however. 

Rather than, for example, adopting some ad hoc weight- 
ing scheme to resolve this problem, we construct an 'or- 
dered' distance estimator, -Dord, which combines the mea- 
sured D sin i values by a different - and rather more elegant 
- method: one which accounts naturally for the shape of each 
order statistic distribution in its definition. 
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Figure 5. Histograms of naive, mean and median cluster distance estimator distributions, derived from 2000 samples of 30 stars at a 
true distance of 200 pc. Results are shown for for an angular diameter error dispersion of = 0.05 and 0.1 



4.4 'Ordered' Estimator, Doid 

Suppose we measure the value of D sin i(^) , for the r*^ star 

in our ordered sample. Given this measured value, the prob- 
ability distribution, pr(a(r)), of the r*'' order statistic of a 
might now equivalcntly be regarded as a probability distri- 
bution, Xr{D), for the true cluster distance, D, viz:- 



Ar(D) = Pr{a 



D sin i 



(r) 



(r)s 



D 



(16) 



This equation follows from equation (9) above. Xr{D) is re- 
ferred to as the likelihood function for D, given the measured 
value of D sin i(^) . Let A{D) denote the product of the like- 
lihood functions for all orders, r = 1, . . . , n. viz:- 



A{D) = l[Xr{D) 



(17) 



We define our ordered cluster distance estimate, Doid, as the 
value of D which maximises A(i)); i.e. Doid satisfies:- 



dA 



= 



(18) 



Since we do not have an analytic form for A{D) we can- 
not calculate -Dord directly by differentiation. It is, never- 
theless, straightforward to determine Dovd simply by com- 
puting A(_D) over a range of trial distances and finding the 
distance which yields the maximum value. 

It is instructive to compare the properties of .Dord with 
the other estimators which we have discussed, and discover 
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if it is indeed a 'better' estimator. We can do this again by 
using Monte Carlo simulations to construct the distribution 
of -Dord at a given true distance. 

As an illustration Figure ^ shows the results obtained 
from computing Dord for 2000 cluster samples at a true dis- 
tance of 200 pc. As in Figures ^to |5|, sample sizes of 10, 20 
and 30 stars and angular diameter errors of 5% and 10% are 
considered. A range of trial distances from 100 pc to 300 pc, 
at intervals of one pc, was tested. Given the spread in the 
distributions obtained, clearly a smaller step-size would have 
been redundant. 

Figure |^ demonstrates that -Dord is a slightly more accu- 
rate distance estimator than each of three estimators previ- 
ously considered: in all cases the bias of Dord is found to be 
negligible, and the dispersion appreciably smaller than that 
of both -Dmcan aud -Dmed- These results are also summarised 
in Table |^. We can see from Table ^ that the dispersion of 
-Dord is around 10% smaller than that of -Dmean in all cases. 
The improvement gained by using Dord appears to be greater 
for smaller sample sizes: i.e. there is less difference between 
-Dord and Dmoan for n = 30 than for n — 10. This would 
seem to be consistent with the central limit theorem, which 
requires that Dmean is asymptotically normally distributed 
(and hence a sufficient statistic) as n increases. It is also 
worth noting that the dispersion of all of the estimators de- 
creases more noticeably when one increases the sample size 



from n = 10 to n = 20, as compared with an increase from 
n = 20 to n = 30. I'urther studies bear out this trend for 
larger samples: beyond a sample size of n ~ 30 there is very 
little further gain in the accuracy of the distance estimators 
- and in particular Dord - by adding new stars to the cluster 
sample. 

The estimator dispersions reported in Table |l| provide 
a direct measure of the relative accuracy of each distance 
estimator at a true distance of 200 pc. In particular, we find 
that Dord is found to be accurate to ~ 7% (at the la level) 
in the worst case examined where we have only 10 stars and 
with (T0 — 0.1; this accuracy improves to slightly more than 
4% with a sample of 30 stars. For — 0.05 the accuracy 
of Dord improves from ~ 6% to ~ 3.5% as the sample size 
increases from n = 10 to n = 30. 



4.5 Properties of Distance Estimators: Summary 

The results of our Monte Carlo simulations - as summarised 
in Table |^ and illustrated in Figures ^ ^ |5[ and ^ - demon- 
strate that the ordered distance estimator, Dord, is the best 
of the four distance estimators which we have considered in 
this pa per. In all cases Dovh is unbiased, and has the small- 



est disp ersion. The naive estimator, Dnaivc, would clearly be 
a poor choice on the other hand. This estimator is positively 
biased, resulting from the naive model for the distribution 
of Dsini which is adopted in its definition. Moreover, the 
bias of Dnaive iucreases as the number of sampled stars in- 
creases. This property is particularly undesirable, since in 
the statistics literature one meets many biased estimators 
which are, nevertheless, consistent - meaning that their bias 
tends asymptotically to zero with increasing sample size. 
Dnaivc instead displays precisely the opposite behaviour. 

It is straightforward to compute the distributions of 
these estimators at a range of different true cluster distances, 
and the same qualitative results are found in all cases, thus 



consolidating our choice of Dord as best cluster distance es- 
timator. 

Moreover, this Monte Carlo approach clearly provides 
a simple means of assigning errors to cluster distance esti- 
mates derived from real data - as we have already indicated 
in discussing the results of Table ^ above. Suppose, for exam- 
ple, that we infer an ordered distance estimate of Dord = A 
from a given sample of real data . To calculate an error 
estimate we first generate a large number of random sam- 
ples, each equal in size to the observed sample and assuming 
Dtrno ~ A, and from a histogram of the distance estimates 
for these artificial samples we derive the distribution of Dord 
at this true distance. We then adopt the dispersion of this 
distribution as our error estimate for the cluster distance. 
In the same way we can use the distribution of Dord (or any 
other estimator) derived from our simulations to determine 
confidence intervals for the true cluster distance, given our 
estimated value. 

This approach is straightforward to implement, but 
does suffer from one specific technical loophole: this con- 
cerns the fact that we generate the distribution of our esti- 
mator assuming Dtmc ~ A, when it is not Dtmc but rather 
our estimated distance, Dord, which is equal to A. A more 
rigorous method for assigning errors and determining confi- 
dence intervals which overcomes precise lv this loophole does 
exist, a nd is described in detail in e.g. Hendry fc Simmons 
(1990)1 or iMood & Graybill (1974)1. In the present context. 



however, we find that the confidence intervals derived by 
the simple method outlined above are essentially identical 
to those derived by the more rigorous approach, and so we 
do not describe the latter method here. 

The smaller dispersion of Dord , as compared with Dmcan 
and Dmcd, illustrates that by ordering the sampled Dsini 
values one can frequently define a better cluster distance 
estimator; Dord being an example of one such estimator. 
There are a number of other methods by which one can 
use the properties of ordered samples to construct a clus- 
ter distance estimate. One appealing technique is to model 
the cumulative distribution function of Dsini for a clus- 
ter at a given true distance, and then construct a sample 
cumulative distribution function to be compared with the 
model distribution. One would adopt as the cluster distance 
estimate the distance which 'best fits' the modelled Dsini 
distribution to the sampled distribution. A suitable criterion 
for identifying the best fit might be, for example, the dis- 
tance which minimises the Kolmogorov-Smirnov statistic for 
the two cumulative distributions. This statistic is frequently 
used in problems of this type and is partic ularly robust to 



the form o f the underlying distribution (c.f. Kendall fc Stu-^ 
art (1963)1 ). We have investigated the use of this method in 



the present context, however, and find that it gives no better 
(and frequently worse) results than using Dord • The robust- 
ness of this approach would be particularly useful, however, 
if one's model for the distribution of Dsini were in some 
way uncertain or ambiguous, and we will investigate the use 
of more robust estimation techniques in future work. 
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Figure 6. Distribution of the order statistics, O(^), of the random variable a, with pdf as given by Figure 1, from a sample size of n : 
10. The r = l,r = 4, r = 7 and r = 10 order statistics are shown. 



Table 1. Mean value and dispersion of naive, mean, median and ordered cluster distance estimators, computed from 2000 simulations 
of cluster samples at a true distance of 200 pc. Results are given, in pc, for a sample size of n = 10, 20 and 30 stars, and for an angular 
diameter error dispersion of cr^ = 0.05 and 0.1 

(70 = 0.05 



<J^, = 0.1 





n = 


10 


n = 


20 


n = 


30 


D 


mean 


"i) 


mean 




mean 




^naive 


221.8 


16.5 


231.5 


14.8 


236.1 


13.4 




200.2 


14.6 


199.9 


10.0 


199.8 


8.5 




199.4 


16.2 


199.8 


11.6 


199.8 


9.8 


^ord 


199.9 


12.2 


199.9 


8.5 


199.9 


7.1 




n 


10 


n = 


20 


n = 


30 


b 


■ 




mean 


''b 


mean 




Dnaive 


232.8 


24.2 


246.3 


22.9 


253.3 


21.3 


Dmean 


200.3 


15.9 


199.8 


10.7 


199.7 


9.2 


Defied 


200.3 


17.6 


200.3 


12.5 


200.5 


10.7 


Dord 


200.1 


14.6 


200.1 


9.8 


200.2 


8.4 



5 BAYESIAN CLUSTER DISTANCE 
ESTIMATES 

In this section we briefly consider the estimation of the clus- 
ter distance as a problem in Bayesian inference. We will 
see that such a treatment is complementary to the analy- 
sis of distance estimators developed in section ^: indeed a 
Bayesian approach follows quite naturally from our chosen 
form for the ordered distance estimator. 



The basic elements which comprise a Bayesian treat- 
ment of the problem can be summarised as follows. We begin 
by postulating a prior distribution for the parameter which 
we wish to estimate - i.e. the true cluster distance, Dtruo. 
This distribution is supposed to express our state of knowl- 
edge or ignorance about Dtruo before any data are obtained - 
in our case the data being our ordered set of D sin i values for 
the sampled stars in the cluster. Note that this postulate im- 
mediately represents a departure from our earlier view that 
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Figure 7. Histograms of ordered cluster distance estimator distributions, derived from 2000 cluster samples at a true distance of 200 pc. 
Results are shown for a sample size of 10, 20 and 30 stars, and for an angular diameter error dispersion of = 0.05 and 0.1 



i?truc was a fixed, although unknown, parameter; we are now 
choosing to regard both Dsini and Dtruc itself as random 
variables. Next we introduce a model which describes the 
probability of observing the data given the parameter value, 
DtTuo - in other words the expected distribution of ordered 
Dsini values in a sample drawn from a cluster at a given 
true distance. 

The essential idea of the Bayesian approach is to com- 
bine these two distributions - our prior distribution for Dtruc, 
and the D sin i distribution given Duuc - to derive a posterior 
distribution for Dtmc, which expresses our state of knowl- 
edge of -Dtrue after we have measured the values of Dsini. 
The form of the posterior distribution is given by Bayes' 
theorem, and can be stated in the present context as:- 



p{Dtiuc\data) = k p(data| Aruc)p(Aruc) (19) 

Here p(-Dtruc) denotes our prior distribution for the true clus- 
ter distance, and k is a normalisation constant which ensures 
that the posterior distribution integrates to 1. 

The Bayesian formulation is not so far removed from 
the ideas which underpin our definition of -Dord in Section ^ 
The concept introduced in equation (17) of a likelihood func- 
tion, X(r), for Dtiua given the measured value of Dsinij^), 
has already alluded to the fact that we could regard Dtruc 
as a random variable. The Bayesian viewpoint extends this 
interpretation of the likelihood function to derive not sim- 
ply a point estimate but rather a probability distribution 
for -Dtruc, in the light of the observed data. In the same 
way as in section W, then, we can regard the probability 
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distribution, p{data\Dtruc) , on the right hand side of equa- 
tion (20) as a function of Dtruc: i-e. a likelihood function, 
L{Dtiuc\data), for Dtruc given the measured Dsini values. 
The likelihood function clearly plays a crucial role in deter- 
mining our posterior distribution: it is the function through 
which the observed data modifies our prior knowledge of 
Dtruc, and can therefore be thought of as representing the 
information about Dtruc which comes directly from the data. 

In the present context L{Dtiua\data) is, therefore, given 
by the joint distribution of ordered Dsini values expected 
in a cluster at true distance, Dtruc, viz:- 

L{Dtruc\data) = pi...„(_Dsini(i), . . . , D sin |Dtruo) (20) 

It follows, moreover, from equation (9) that we can rewrite 
this as:- 



L{Dtiuc\data) = pi...„(q:(i) , . . . , 
Where, of course, ai^.) = 



for all r — 1, 



(21) 



as be- 



for e. A general ex pression for this joint distribution is given 
in David (1981) in terms of the density function and cu- 
mulative distribution of a. It is important to note that this 
joint distribution will not in general be equal to A(D), as 
defined in equation (18) above, however. In other words the 
joint likelihood function for all of the D sin i order statis- 
tics is not in general equal to the product of the individual 
likelihood functions. This is because the sample Dsini val- 
ues are not independent of each other: the measured value 
of Z)sini(„), for example, must have a bearing on the dis- 



»(n), 

tribution of Dsini 



(n-l), 



since we now have the constraint 



that D sin i(n-i) < D sin i(„) , and so on for the smaller order 
statistics. 

We now illustrate the application of this Bayesian ap- 
proach to some typical cluster samples. Figure ^ shows the 
posterior distribution for Dtmc obtained from an ordered 
sample of Dsini values, computed for sample sizes ranging 
between 10 and 30 stars. All samples were drawn from the 
simulated D sin i distribution of a cluster at a distance of 
200 pc, and with ct^ = 0.05. In each case the prior distribu- 
tion for Dtruc was taken to be uniform within the range 150 
to 250 pc. 

We can see from Figure ^ that the Bayesian approach 
yields results which are broadly consistent with the estima- 
tor distributions derived from Monte Carlo sampling in Sec- 
tion ^ With a sample size of 10 stars, for example, we find 
that the posterior distribution has a dispersion of ~ 10 pc, 
which is consistent with the error estimates derived for Do^d 
in Table |l|. One can, of course, also derive Bayesian confi- 
dence intervals for Dtrue directly from the posterior distri- 
bution. 

It is clear from Figure ^ that the posterior distribution 
becomes 'sharper' as the sample size increases - indicating 
a more accurate Bayesian estimation of the true distance 
from larger samples, as one would expect. Moreover, we find 
that there is little further reduction in the dispersion of the 
posterior when one's sample contains more than 30 stars 
- which is also analogous to the results of Section ^ 

One need not carry out our Bayesian reconstruction 
using all n order statistics: one can use any single order 
statistic, or any subset of the full range of orders, with re- 
sults which accord with the distributions of the order statis- 
tics used. The posterior distribution recovered from using 
only the smallest Dsini value, for example, is considerably 



poorer than those shown in Figure ^ this merely refiects the 
relatively large dispersion of the a(i) distribution - as can be 
seen in Figure ^ above. In general we find that - for a given 
sample - the posterior of smallest dispersion is obtained by 
using the complete ordered sample of D sin i values. 

The choice of prior distribution in the application of 
Bayesian methods is clearly very important. The use of a 
non-uniform prior is the source of considerable controversy 
in the statistics literature, since it is often alleged that such a 
prior prejudices one's results by, as it were, forcing the data 
to say something different about Dtruc than the information 
which they in fact contain. In recognition of this point, a 
uniform prior seems to us to be a more appropriate choice. In 
this case it is solely the properties of the likelihood function 
which determine the form of the posterior distribution - and 
not some predisposed view of what the true cluster distance 
'should' be. 

Nevertheless, a powerful feature of the Bayesian ap- 
proach is the fact that one can take the posterior distri- 
bution recovered from a given cluster sample and adopt this 
as a new prior, in order to determine an 'updated' posterior 
distribution in the light of new data - i.e. as Dsini values 
are measured for additional stars in the cluster. We will in- 
vestigate this extension to our analysis in future work. 



6 CONCLUSIONS 

In this paper we have presented a new method for determin- 
ing the distance to nearby open clusters. The method is ap- 
plicable to clusters containing fast-rotating late type stars, 
whose rotation periods have been measured from detecting 
the rotational modulation of surface inhomogeneities. The 
period of each star is then combined with its projected rota- 
tional velocity and an estimate of its angular diameter, in- 
ferred from the Barnes-Evans relation, to form an estimate 
of the projected cluster distance, Dsini. 

We have shown how one may then combine the set of 
Dsini values inferred from each star in the cluster sample 
to form an 'ordered' distance estimator of the true cluster 
distance. We have investigated the properties of this estima- 
tor using Monte Carlo simulations of cluster samples, after 
careful modelling of the intrinsic scatter in the Barnes-Evans 
relation and the observational selection effects to which the 
samples are subject. We have shown how one may apply 
this distance method to real data samples, in order to de- 
rive error estimates and confidence intervals for the true 
cluster distance. We have also demonstrated that this new 
method is amenable to a Bayesian analysis, and have again 
illustrated this approach using artificially generated cluster 
samples. 

We have found that, for realistic models of the random 
variables, the accuracy of our distance method is between 
~ 3 and 7% (at the la level) - depending on the size of 
the cluster sample - which is comparable with the precision 
of distance estimates obtained by ZAMS fitting techniques. 
Since it is subject to a different set of systematic errors and 
model assumptions than ZAMS fitting - and in particular 
does not rely upon a zero-point calibration by a single clus- 
ter, usually the Hyades - these results indicate that our new 
method can play a useful and important role alongside more 
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Figure 8. Examples of the posterior distribution for the true cluster distance, derived by applying the bayesian method described in 
the text to simulated cluster samples containing between 10 and 30 stars. In all cases the simulated samples were drawn from a cluster 
at a true distance of 200 pc 



traditional cluster distance indicators in better determining 
the local distance scale. 

In a forthcoming paper, currently in preparation, we 
will apply our new distance method to real data samples 
taken from the o Per and Pleiades clusters, and explicitly 
compare the results of our method with those obtained by 
the ZAMS procedure. Other possible future applications of 
our method include a better calibration of the Barnes-Evans 
relation - and determination of the inclination distribution 
- for fast rotators in young open clusters, by comparing our 
results with the very accurate cluster distances soon to be 
provided by the Hipparcos satellite. 
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